Distributed Graph Layout for Scalable Small-world Network Analysis
نویسندگان
چکیده
The in-memory graph layout or organization has a considerable impact on the time and energy efficiency of distributed memory graph computations. It affects memory locality, inter-task load balance, communication time, and overall memory utilization. Graph layout could refer to partitioning or replication of vertex and edge arrays, selective replication of data structures that hold meta-data, and reordering vertex and edge identifiers. In this work, we present DGL, a fast, parallel, and memory-efficient distributed graph layout strategy that is specifically designed for small-world networks (low-diameter graphs with skewed vertex degree distributions). Label propagation-based partitioning and a scalable BFS-based ordering are the main steps in the layout strategy. We show that the DGL layout can significantly improve end-to-end performance of five challenging graph analytics workloads: PageRank, a parallel subgraph enumeration program, tuned implementations of breadth-first search and single-source shortest paths, and RDF3XMPI, a distributed SPARQL query processing engine. Using these benchmarks, we additionally offer a comprehensive analysis on how graph layout affects the performance of graph analytics with variable computation and communication characteristics.
منابع مشابه
Spectral Routing in Small-World Networks
The small-world property is one of the most common and well-known features of complex networks: those networks are heavily clustered, yet any two nodes are very often connected by a short number of links. Usually, models that create networks with small-world properties are based on the idea that nodes have some kind of reciprocal a nity. Nodes are placed on a k-dimensional layout, where close n...
متن کاملA Scalable Parallel Force-Directed Graph Layout Algorithm
Understanding the structure, dynamics, and evolution of large graphs is becoming increasingly important in a variety of fields. The demand for visual tools to aid in this process is rising accordingly. Yet, many algorithms that create good representations of small and medium-sized graphs do not scale to larger graph sizes. The exploitation of the massive computational power provided by parallel...
متن کاملSpeeding up Network Layout and Centrality Measures with NodeXL and the Nvidia CUDA Technology
In this paper we talk about speeding up calculation of graph metrics and layout with NodeXL by exploiting the parallel architecture of modern day Graphics Processing Units (GPU), specifically Compute Unified Device Architecture (CUDA) by Nvidia. Graph centrality metrics like Eigenvector, Betweenness, Page Rank and layout algorithms like Fruchterman-Rheingold are essential components of Social N...
متن کاملStorage and Processing Systems for Power - Law Graphs
Large graphs abound around us – online social networks, Web graphs, the Internet, citation networks, protein interaction networks, telephone call graphs, peer-to-peer overlay networks, electric power grid networks, etc. Many reallife graphs are power-law graphs. A fundamental challenge in today’s Big Data world is storage and processing of these large-scale power-law graphs. In this thesis, we ...
متن کاملDistributed Dynamic Graph Analytic Framework: Scalable Layered Multi-Modal Network Analysis
Dynamic Graph Analytic Framework (DYGRAF) is a domain agnostic framework from which data alignment, data association, and layered multi-modal network analysis can be performed. Past installments of DYGRAF have been able to provide analytic insight for small and medium-sized data sets, however scalability becomes problematic as the amount of collected data increases. In this work, we discuss ext...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1701.00503 شماره
صفحات -
تاریخ انتشار 2017